Low bit rate coding for speech and audio using mel linear predictive coding (MLPC) analysis
نویسندگان
چکیده
This paper proposes a low bit rate coding method for speech and audio using a new analysis method named MLPC (Mel-LPC analysis). In the MLPC analysis method a spectrum envelope is estimated on a melor bark-frequency scale, so as to improve the spectral resolution in the low frequency band. This analysis is accomplished with about two-fold increase in computation over the standard LPC analysis. Our coding algorithm using the MLPC analysis consists of five key parts: time frequency transformation, inverse filtering by the MLPC spectrum envelope, power normalization, perceptual weighting estimation, and the multi-stage vector quantization. In subjective experiments, we have investigated the performance of MLPC analysis method, through the evaluation of paired comparison tests between the MLPC analysis and the standard LPC one in inverse filtering. In all bit rates, almost all the listeners feel decoding signals by the MLPC analysis method is superior to the LPC one. Especially in low bit rate, there is a great difference between them.
منابع مشابه
Entropy Coding of Spectral Envelopes for Speech and Audio Coding Using Distribution Quantization
Speech and audio codecs model the overall shape of the signal spectrum using envelope models. In speech coding the predominant approach is linear predictive coding, which offers high coding efficiency at the cost of computational complexity and a rigid systems design. Audio codecs are usually based on scale factor bands, whose calculation and coding is simple, but whose coding efficiency is low...
متن کاملAdaptive forward-backward quantizer for low bit rate high-quality speech coding
A novel variable rate linear predictive coding (LPC) parameter quantization scheme is proposed in which linear prediction is done by using either the current (forward LPC) or previously decoded (backward LPC) speech blocks. The proposed LPC quantization scheme was integrated into the FS1016 Federal Standard CELP coder. Signi cant LPC bit rate reduction is achieved without compromising the decod...
متن کاملAudio Re-Synthesis based on Waveform Lookup Tables
Transmitting speech signals at optimum quality over a weak narrowband network requires audio codecs that must not only be robust to packet loss and operate at low latency, but also offer a very low bit rate and maintain the original sound of the coded signal. Advanced speech codecs for real-time communication based on code-excited linear prediction provide bandwidths as low as 2 kbit/s. We prop...
متن کاملSpeech Compression Using Linear Predictive Coding
The aim of the project is to develop a system for encoding good quality speech at a low bit rate. To implement this we have used most powerful speech analysis technique called Linear Predictive Coding (LPC). It uses 10 order Levinson-Durbin Recursion algorithm to accomplish the task. It provides extremely accurate estimates of speech parameters, and is relatively efficient for computation.The s...
متن کاملSpeech Compression Using Linear Predictive Coding(lpc)
One of the most powerful speech analysis techniques is the method of linear predictive analysis. This method has become the predominant technique for representing speech for low bit rate transmission or storage. The importance of this method lies both in its ability to provide extremely accurate estimates of the speech parameters and in its relative speed of computation. The basic idea behind l...
متن کامل